FMA: A Dataset for Music Analysis
نویسندگان
چکیده
We introduce the Free Music Archive (FMA), an open and easily accessible dataset which can be used to evaluate several tasks in music information retrieval (MIR), a field concerned with browsing, searching, and organizing large music collections. The community’s growing interest in feature and end-to-end learning is however restrained by the limited availability of large audio datasets. By releasing the FMA, we hope to foster research which will improve the state-of-the-art and hopefully surpass the performance ceiling observed in e.g. genre recognition (MGR). The data is made of 106,574 tracks, 16,341 artists, 14,854 albums, arranged in a hierarchical taxonomy of 161 genres, for a total of 343 days of audio and 917 GiB, all under permissive Creative Commons licenses. It features metadata like song title, album, artist and genres; user data like play counts, favorites, and comments; free-form text like description, biography, and tags; together with fulllength, high-quality audio, and some pre-computed features. We propose a train/validation/test split and three subsets: a genre-balanced set of 8,000 tracks from 8 major genres, a genre-unbalanced set of 25,000 tracks from 16 genres, and a 98 GiB version with clips trimmed to 30s. This paper describes the dataset and how it was created, proposes some tasks like music classification and annotation or recommendation, and evaluates some baselines for MGR. Code, data, and usage examples are available at https://github.com/mdeff/fma.
منابع مشابه
Automatic Identification and Classification of the Iranian Traditional Music Scales (Dastgāh) and Melody Models (Gusheh): Analytical and Comparative Review on Conducted Research
Background and Aim: Automatic identification and classification of the Iranian traditional music scales (Dastgāh) and melody models (Gusheh) has attracted the attention of the researchers for more than a decade. The current research aims to review conducted researches on this area and consider its different approached and obstacles. Method: The research approach is content analysis and data col...
متن کاملLarge Scale Metric Learning for Music Similarity
Automatic music classification is an important problem in music analysis. In this paper, we use data from the “Million Song Dataset” to construct and evaluate music similarity metrics and metric learning techniques. While others have done metric learning on music datasets before, we evaluate which standard techniques perform best in both accuracy and time on this much larger dataset. We find th...
متن کاملFrequentist Model Averaging with missing observations
Model averaging or combining is often considered as an alternative to model selection. Frequentist model averaging (FMA) is considered extensively and strategies for the application of FMA methods in the presence of missing data based on two distinct approaches are presented. The first approach combines estimates from a set of appropriate models which are weighted by scores of a missing data ad...
متن کاملA Study on Cross-cultural and Cross-dataset Generalizability of Music Mood Regression Models
The goal of music mood regression is to represent the emotional expression of music pieces as numerical values in a low-dimensional mood space and automatically predict those values for unseen music pieces. Existing studies on this topic usually train and test regression models using music datasets sampled from the same culture source, annotated by people with the same cultural background, or o...
متن کاملTranslation, Validity, and Reliability of the Upper Extremity Fugl-Meyer Assessment (FMA-UE) in Persian Speaking Stroke Patients
Objectives: Fugl-Meyer Assessment (FMA) is the common scale for clinical and functional evaluation of sensorimotor conditions and related Upper Extremity (UE) dysfunction after stroke. This study was done to translate and cross-culturally adjust the original upper extremity FMA (FMA-UE) into Persian and to evaluate the psychometric properties of the translated version. Methods: A procedure of...
متن کامل